Review of Nelson's analysis of Bell's theorem 

Benjamin Schulz, * 
February 19, 2013 



Abstract 

This article contains a review of Nelson's analysis of Bell's theorem. It shows that Bell's 
inequalities can be violated with a theory of local random variables if one accepts that the outcomes 
of these variables are not predetermined prior to measurement. Furthermore, there can be no model 
of the singlett state at all where the outcomes do not explicitely depend on the settings of the 
measurement devices. The article describes the relations between Bell's theorem and the Free Will 
theorem of Conway and Kochen. The article closes by relating the various assumptions needed to 
derive Bell's theorem with the reality criterion of EPR 
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1 Introduction 



In 1935, Einstein, Podolsky and Rosen (EPR) wrote an article in which they denied that quantum 
theory would be a complete theory of nature [T|. Around 1951, Bohm gave a more testable outline 
of the so-called "EPR paradox" [2j|3]. He described a thought experiment with one source that 
ejects particles having opposite spin to two spatially separate Stern-Gerlach magnets of variable 
orientations, see Fig. [T] Then, in 1964, Bell published a theorem about this paradox in the form 
of an inequality. It made clear that hidden variable theories fulfilling certain conditions would 
contradict quantum mechanics. Bell called these conditions "locally causal" and explained them 
in detail in [?]. 

In his first publication on that topic, Bell wrote: "If hidden parameters would be added to 
quantum mechanics, there must be a mechanism, whereby the setting of one measuring device 
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Figure 1: Drawing of an EPR experiment with Stern-Gerlach 
magnets 1, 2 and axes jl, v. The black dots illustrate the 
particles moving to the detectors. Their spin direction with 
respect to the z axis is indicated by the small arrows around 
them. 
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Figure 2: Space-time diagram of 
an EPR experiment. The dashed 
arrows display particle trajectories 



can influence another spatially separated device" and the signal involved has to "propagate in- 
stantaneously" [5]- However, Bell's first contribution underwent several modifications. Over the 
years, more and more instructive proofs of his inequality were constructed by him and others. A 
collection of all of Bell's fundamental articles can be found in 0. Finally in 1969, Clauser, Holt, 
Shimony and Home (CHSH) brought the inequality of Bell into a form suitable for experimental 
investigation [8]. 

In 1985, Nelson tried to analyse Bell's theorem with full mathematical rigour. The result of 
his analysis was that Bell's definition of "locally causal", which is the starting point to derive 
Bell's inequality, could be divided into two separate conditions. Both were necessary to derive the 
inequality, but only one of them has to hold if in a hidden variable theory an outcome depends 
on changes made at a spatially separated location. Nelson published his result twice [9j IIP] , 
(However, a small correction was added later [11], in order to make Nelson's theorem compatible 
with Mermin's presentation [12].) Furthermore, there are studies from other mathematicians, 
for example [13], which give further insightful analysis of Nelson's work. Unfortunately, Nelson's 
articles on Bell's inequality got almost overlooked by physicists. The reason for this might be that 
Nelson found his own interpretation of quantum mechanics [141 115] to be at variance with the 
requirements of his theorem for a model without instantaneous signalling effects. 

This article is organised as follows: In section 2, we review Nelson's contribution towards 
a mathematically rigorous understanding of Bell's inequality. The article will need some un- 
derstanding of mathematical probability theory. For a general introduction to this theory, see 

nsi nzi nn us] 

2 Nelson's analysis of Bell's theorem 
2.1 The setup of the EPR experiment in theory 

An EPR experiment consists of two measurement devices 1 and 2 which are space-like separated, as 
well as a particle source in the intersection of their past cones. The source ejects pairs of particles 
to the two detectors. At 1 and 2, the direction of the spins is measured with a Stern-Gerlach 
magnet. These magnets can be rotated around arbitrary directions. The direction of the magnets 
is called p. for detector 1 and v for detector 2 (see Figs. [1] [2} . 

When the particles arrive in the Stern-Gerlach magnets, the magnetic field of these devices 
could change the particle's properties, including the spin. In an EPR experiment, the parts of 
the detectors that might influence the particles are the axes pi and v. Those axes can be chosen 
by the experimenter freely at will. Due to the Stern-Gerlach magnets, we have to describe our 
measurement results by device dependent random variables. For the two detectors 1 and 2, we 
define two axis dependent families of random variables D^p : Q\ t-»- Q' 1D and Dj^ff : ^2 <-> f^2- 
The outcomes of these random variables are Q'm = ^2D — {1\4-}' Since the random variables 
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Djip and Dj2P depend on the setting of the Stern-Gerlach magnets, we write their outcomes on 
the state space using the axis dependent notation aip and 5"2p. 

The results at the detectors could depend on the preparation of the particles by the source. 
We denote the set of outcomes at the source as fls an d say that the corresponding events happen 
at preparation stage. Since we have a set of outcomes at the source and at the detectors, we 
must describe the EPR experiment with an enlarged probability space. We denote the outcomes 
of that space as Q = fiio x Qs x ^2D and define a sigma algebra T on that set by the power set 
T = V (fl). The probability measure of the enlarged probability space will be denoted by P. 

On this enlarged probability space, we introduce a familiy of random variables 

(pip ® 4>2V ■ fi >-»■ fi'lD x fl' 2D (1) 
whose definition is 



{(pip ® <p2v) (u>l) = D n p (Win) (g> Dj2S (war)) (2) 

with £ (! and wid G Old, and W2D G £^20- 

The random variable <g) <p 2 v gives information on the joint outcomes that happen simul- 
taneously at each measurement station with respect to events on the enlarged space. To make 
contact with Nelson's notation, we define the notations a 1 p = &ip x 0! 2D , (Tip =t=t X ^2D> an d 
(Tip =|=| The events {(p 1 p ® <p2V G (Tip =1} and {<p ltl ® <p 2 a G (Tip =f} contain informa- 

tions on the outcomes at detector 1 for axis p. We let the set of these events for all axes jl at 
1 generate a sigma algebra !Fi. Then, T\ contains all information about the events that might 
happen at 1. 

Similarly, we define the notations aiv = f^ D x 02,7, <r 2 a =t= Q'w x ti aQ d C2S =4-— SI'id x i- 
The events {4>ip ® 4>2S G (t^s =i} and {<^>i,j ® (p2ff G (X23 =f} contain the information on the out- 
comes at detector 2. We let the set of these events for all axes v at 2 generate the sigma algebra 
T*. 

The probabilities of the events that are generated by 4>lp ® <p2P will be denoted by 

P ({(pip <8> <p2v G [a x p P| CT2iA }) = P^jjj^^j, (cr^ P| a 2 pj . (3) 

The expression P,^-®^ defines an axis dependent family of probability measures for events of 
the form a 1 p |"| 02P, where 

Since the family of random variables (pip ® (p2v is defined on the enlarged probability space, it 
will give us the possibility to investigate the relationship between the outcomes at the detectors 
and the ones at preparation stage. We let a third sigma algebra Fs contain the information about 
the outcomes at the source. Accordingly, Ts is generated by the power set Q.id x V(Qs) x &2D- 

We can compute the conditional probability of the events at the detectors given the events in 
J-"s that happen at preparation stage with: 

P(A\Ts)=EX[l A \T s ] (4) 

Eq. ([4]) is the definition of a random variable for outcomes in the enlarged probability space. It 
can be interpreted as the revised probability of an event A to happen with respect to the extra 
information about which events in Ts occur for a given outcome. Obviously, P ( A\ Ts) fulfils the 
following properties: 

• For all events A G T, the random variable P (A\ J\s) is measurable with respect to J-s- 

• We have < P ( A\ Ts) < 1 for all events A G J- and outcomes on the enlarged probability 
space. If A is the sure event ilm x Hi x SI2 x S! 2 d, the conditional probability P ( A\ is 
unity for all outcomes on the enlarged probability space. In case A is the impossible event 
0, the random variable P (A\ J-s) is zero for all outcomes. 

• For every sequence of pairwise disjoint events (A n ) n( - ti G J-, the equality P (U^Li An\ J~s) = 

>;: : 1 ?s) holds. 

• The unconditional probability of any event A € J 7 can be computed with the expectation 
value EX[P (A\F S )] = fP(A\F s ) dP = P (A). 
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• For any event A £ T and another event Ais £ J~s, where Ais 7^ 0, it follows from Eq. 
© that P {Af]Ai S \Ts) = P(A \ T S ) for outcomes in A 1S , and P ( A f| A 1S \ T s ) = for 
outcomes not in Ais. 

In case that the events on the enlarged probability space are generated by (j>ifi ® 4>2S, we will 
use the following notation: 

P '<t>lit®<t>2i> i^fi X ^2£f| ^S) = P ({<£l£ ® ^2,7 £ x ^2v)}\ Ts) ■ (5) 

In the analysis of the EPR experiment, we will often have to deal with equivalent events. Two 
events A and B in T are equivalent if 



This means that if A happens, then B also happens and vice versa. This does not mean, however, 
that A causes B to happen or vice-versa, if A and B are equivalent to each other, then their 
complements (A) c and (B) c must also be equivalent to each other. Furthermore, one can show 
that if the event A is equivalent the event B, and B is equivalent to another event C, then A is 
also equivalent to C. Furthermore, 

Now, all the mathematical structures needed to analyse the EPR experiment have been defined 
and we can proceed with the necessary locality conditions. 

2.2 Active locality 

At first, Nelson defined two different forms of locality: Active locality and passive locality. The 
meaning of active locality is: 

Whatever axes the experimenter selects at one measurement device, e. g. at 2, it does not 
change the outcomes at 1, as long as 1 does not lie in the future cone of 2. Active locality is a 
requirement from the theory of relativity. We can mathematically define active locality as follows: 

The Stern Gerlach magnets may influence the outcomes of the experiment as a function of their 
axes. The corresponding events are generated by the axis dependent random variables <f>ip <g> (j>2p. 
Now we let the area of measurement station 1 be disjoint from the future cone of station 2. With 
the axis fl of 1 being left constant, different axes v 7^ v' of Stern-Gerlach magnet 2 are chosen. We 
call a theory actively local if the events {4>iji ® 4>2ff £ Pip} £ J~i and {<j>iji 53 4>2ff' £ £ ^1 are 
equivalent. That is 

P {{<plp, ® 4>2,7 £ Cr-Ljt} Pi {fajj <g) faff' £ <Tl/z}) = P <f>lfl®<i>2,7 (°1m) = F <Pir^20' • ( 7 ) 

Eq. implies that a change of the setting at station 2 does not modify the outcome at station 1. 
Note that from Eq. ©, only 

P^i/j®^? = p ^i^®</» 2 p' (8) 

is actually required by the quantum mechanical probabilities of quantum mechanics. 

A similar condition should hold for the events {(pip CS> 4>2v £ o"2p} £ Fi and {4>ift' ® 4>2a £ o"2p} £ 
J~2 at Stern-Gerlach magnet 2, provided that the spatial region of 2 is disjoint from the future 
cone of magnet 1: 



In Eq. ((9J, we have ft' 7^ fl for the axes of detector 1, with the axis v of magnet 2 being left 
constant. 

By Eq. (5J, we can condition the probabilities of {<^>i/j §3 4>2P £ &iji} £ J~i an d {4>iji ® 4>2S £ &2a} £ 
T2 with respect to Ts and we get 




(6) 




■2H {°2v) = P ( {4>ljl ® 4>2v £ 



a 2u\ P| ® ^2? £ CT2u}) = ~P^i0®^0 ( a 2C>) • (9) 



(10) 

(11) 
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It is important to note that Eq. describes the probabilities which are given by quantum 
mechanics. This equation forbids in any theory that describes the quantum mechanics probabil- 
ities to send instantaneous signals to spacelike separated locations. In contrast Eqs. Q and ([9]) 
are not given by quantum mechanics and are violated by many hidden variable theories, such as 
Bohmian mechanics. 

Because quantum mechanics does not contain Eqs. © and Q, it does not forbid an explanation 
of its outcomes with a model that violates causality. Someone who strictly insists on the relativity 
principle could therefore view quantum mechanics as an incomplete theory. 

Furthermore, one should note that, when describing an event by P^-.®^, (cri/z f] oiv), one can 
not conclude from active locality alone that we would have 

p 0iM®^2,? (^imP) " 217 ) = p 4v®<^' (°v rV 2 - 7 ') • ( 12 ) 

The reason for this is that the event {4>ip ® <\>2H G oip) G J~i must not be equivalent to the 
event {(f>i0 ® <$>io G a ifi'} G -Fx- Similarly {(pip ® (f>2S G 0"2p} G .F2 must not be equaivalen to 
{01m ® 02i" G ""is'} G J-2- The physical reason for this is that spin operators, like all angular 
momentum operators in quantum mechanics, do not commute. 



2.3 Passive locality 

In this subsection, we consider the conditional joint probability of {(pip <8> 4>2a G (Tip} G Ti and 
{4>ip®(p2V G 0-2,7} G J-2 with respect to Ts- It gives information about events which happen 
simultaneously at the spatially separated locations 1 and 2. We say that passive locality holds if 

p 0i f -i®<te ( °V P| ^ j = P ^i f t8>02,? (°"imI ^s) P<t>iiz®<i>2i> i°'20\J r s) , (13) 

for every pair of axes /2 and v. The outcomes of an experiment may be statistically dependent. 
Such a dependence may arise because of a preparation stage. The condition of passive locality says 
that the outcomes should be conditionally independent given the information about preparation 
stage Ts- A violation of passive locality would mean that the dependence of the outcomes does 
not originate at Ts- It is possible to have active locality without passive locality. Furthermore, a 
theory that violates passive locality does not have to incorporate any non-local interaction between 
the spatially separated measurement stations 1 and 2. However, the true physical implications of 
the passive locality condition will be postponed at the moment. 

2.4 Nelson's theorem 

With the definitions above, quantum mechanics fulfils the following properties: 

p M ®^2 t r (<7i/z =t n °w =+) + p <^®^ f t (^im =4- n a w =t) = !> ( m ) 



|E v) - E (p, v) + E (ft, i?) - E (//, v) j > 2. (15) 
Equation (|14p implies that if the axes of the Stern-Gerlach magnets are the same, the spin values 
measured at 1 and 2 are always opposite. Using P^agfoa ((Tip =t) + p 0i f t®<*2 f t ( a ip = 4-) = 1 an d 
P <£ia®02;i (°2m =t) + p <?>i^®02m (°"2m =40 = 1, we can rewrite Eq. (TJJ} in the following form: 

F ' <t>lH®<t>2[i =t) = P <#>1^®02£ (°"lM =t fl °"2m =4-) = P 01jI®02,I (°"2fl =4-) , 

+ + + (16) 

P 01m®02m ( CTl ^ = 4-) = P 01m®^2m = 4- n°"2M =t) = P 01^®02m (°" 2 M =t) • 

This makes clear that the events at the detectors are equivalent if the same axes at the separated 
measurement devices were chosen. 

The function E(/2, v) in Eq. (|15|l is called correlation coefficient. It is defined through: 

E (p, v) = P^ -®4> 2 - (crip =t P) 02s =t) + P*i ? ®0 2i 7 =4- Pi °" 2 ' 7 

- P<t>ifl®<t>2e [ ff xa =t fl " 2 " 7 ~ p <*i f t«02,? (°V =4- rV 2p • ( 17 ) 
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It was shown by Nelson that Eq. (|15[) is in conflict with theories where Eq. (|14[) and active as well 
as passive locality hold. Nelson's proof [9] [10] (with corrections in 11 ) proceeds as follows: 

We are interested in the properties of the conditional probabilities of the events {</>ip ® 4>2P £ <Jip} £ 
T\ and {4>\fi ® 4>2P £ 02a} £ given the sigma algebra Ts. At first, we will investigate, what 
the equivalence property of Eq. (fT6)) combined with the assumption of passive locality implies for 
these conditional probabilities. 

If the axes at the two detectors are the same, i. e. ft = u, we have, due to passive locality: 

P 0i,-i®02m (cvfV 2 ' 7 ! Ts ) = P *i*®*V (^imI-^sOP^®^ (ovI-Fs) • (18) 
Since < P^®^ (cr 2ft \ F s ) < 1, we get 

P 0im®^2m (^^D " 2 ^! • Fs ) - P ^i f t®^ OimI-^s) ■ (19) 
With the properties of the conditional probabilities stated in section 2.1, it follows that 

P <^®^ M = EX [P^e^p M -^)] (20) 
and, similarly, 

p*^®^ (^lMn "^) =ex [p^m®^,! (^lMn " 2 ^! • Fs )] ■ ( - 2i - ) 

Using Eq. (|16|) . we observe that an event {4>ip ® 4>2P £ at detector 1 implies an equivalent 

event {cj> lf i <g> 2 p £ 02P } at 2, i. e. 

P 4>1P®4-2P = P <^1m®^2A (^ImPI " 2 ^) = P ^l f t®<^2M (°" 2 m) • ( 22 ) 

Hence, the expectation values in Eqs. (|20l) and (|21[l are equal. Accordingly, we get with Eq. (|19[) : 
P ^i f -i®^2A (^imPI " 2 ' 7 ! = P^®^/» Oitfl-^s) , (23) 

and 

P 0i,-i®02 t t (^imPI " 2 ' 7 ! • 7r ' s ') = P<^®<% Oatfl-Fs). (24) 

Now, we will look at the consequences of active and passive locality for the conditional proba- 
bilities. Passive locality demands for events {(f>ip ® 4>iv £ crip =t} at 1 and {4>ip <8> (j>2v £ Giv =4} 
at 2 that: 

KP^^C^^I^s). (25) 

Active locality implies that the actions happening at 2 can only affect anything in the future 
cone of 2. Since 1 does not lie in that cone, nothing what happens in 2 can affect the outcomes 
measured at 1. Therefore, the events {4>iji ® 4>2H £ &ijt =t} and {4>ip O 02^ £ Ci£ =4} are equiv- 
alent: 

p 0i^cs02,? (°V =11 Fs) = P^®^ =tl-^s) = Pit- (26) 
An analogous expression is true for the events at 2: 

p 0i£®<Z>2P (°"2P =41 ^s) = p </>ip8>02P ( CT 2P =1! Fs) ■ (27) 

Without loss of generality, we can select an axis ft = v with v from Eq. (|27[) in the Eqs. (|16l) . 
(|23|) and (|24[) . It then follows from Eq. (|16[) that the event {</>itr ® (j>2H £ crip =4} at detector 2 in 
the right hand side of Eq. (I27[) implies an equivalent event of {4>ip ® 4>2H £ cw =t} at 1. Using 
the Eqs. ()23|) and (|24|) with an axis ft — v, we can conclude that 

p 0i^®02P (°"2P =41 .Fs) = p ^>ip®^>2P ( a 2p =4 Fs) 

= p 0i P csfe =tl ^s) = iV. (28) 
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Plugging Eq. ([26]) and Eq. (J28JI back to Eq. ([25]>. we arrive at 

P^it^a ( ct im =tf)<T20 =l\rs) = PpPp- (29) 

The events {(pip <S> <^>2i7 G =t} an d {<^i/i ® 0217 G f"2i7 =i} are disjoint and their union is the 
sure event. According to section 2.1, the sum of the conditional probabilities with respect to a 
sigma algebra is equal to unity for such events. Therefore, we get with Eq. (|28[) : 



P<t>ip®<t> 2 a (°"2P =tl Fs) = P^ip®0 2 p ( a 2P =t| Fs) — 1 — Pp- (30) 

Since active locality holds, the events {(j>ip ® (j>2P G fi M and {4>ip ® 4>2p G (Xi^ =i} are equiv- 
alent, and we get, since that the complement of {4>ip ® <$>2p G cr lM =4} and {0i/z <8> <j>2ji G 01^ 
is the sure event: 

P ^®^2s ( ipt =4-1 ^s) = p ^®^2/i ( a ift =41 Fs) 

= l-P* liS ®* 2it {viii=t\J r s) = l-Pp. (31) 
Due to passive locality and the Eqs. (131 [I and (|30[) . we have for the events {<f>ip ® 02£? G 0i£ =J,} 
and {^iji ® <j) 2 p G <x 2 c? =f} : 

P^fe (*i/J =lf> 2 -=^|- Fs ) = (1 - -Pm) (1 - ^) • (32) 
In the same way, we can derive the relations 



°ip =t f) <?2ff =t| J=s) = Pjt (1 - Pp) (33) 



p*m®* 2 * ( ffii* =4- n ^ =4-1 jr s ) = (i- Pp) p,y. (34) 



and 



Using the Eqs. ([33]), (J34J) , Q29J and l|32]l. we may define the function 

E(/?,i7|J- s ) = Pp (I - P?) + (I - Pp) P ff - PpP ff - (I - Pp) (I - P p ) . (35) 

The conditional probabilities in Eq. (|35|) are all in the range < P<p 1 p®t/> 2C ; ( °im H G2v \ -Ps) — 1- 
Therefore, the following inequality can be computed with four arbitrary axes ft, ft' and v, v' at the 
two stations 1 and 2: 

\E(ft,V\ F 8 )+E(fty\ F s ) + E(jt,P\ JFa) - E (ft 1 , v\ Fs) < 2. (36) 



If we replace the conditional probabilities in Eq. Q36[) by their corresponding unconditional proba- 
bility measures, we arrive at the correlation coefficient from Eq. (|15[) . The unconditional probabil- 
ities are also in the range of < P^a®^ (oi^ {~\&2v) < 1, and they are given by the expectation 
values of the conditional probabilities. Hence, an inequality analogous to Eq. (|36[) must be true 
for them: 

|E (ft, v)+E (ft, if) + E (ft', u) - E (ft', if) | < 2. (37) 



Equation (|37[) is called Clauser-Holt-Shimony-Horne inequality [8] . It is a version of the statement 
in Bell's first article [5] 

\E(ft,v)-E(ft,if) \ -E (v,v')< 1, (38) 



which can be similarly derived. Both Eq. (|37[) and Eq. (|38[) are violated in quantum mechanics and 
this violation was confirmed experimentally in 1982 20 . It should be notet that Jarret [21] arrived 
at a similar conclusion even though he did not formulate his article within rigorous probability 
theory. 
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3 Implications of passive locality 

We begin by analysing the consequences of passive locality. Faris showed in [13] that passive 
locality, when combined with relation (|14|l from quantum mechanics, immediately leads to another 
condition, which he calls "deterministic passive locality" . According to Faris, Deterministic passive 
locality is defined as follows: 

Let the events {fop <8 fop £ crip} £ Ji at 1 and {fop ® fop £ <J2p\ £ at 2 be equivalent 
with respect to P<i> 1 p®<i>2p an d passive locality hold. Then, there must be an event Ais £ J~s at 
preparation stage, which is equivalent to both {fop CS> fop £ a ip} £ J~i an d{^i/i ® fop £ < J 2p} £ 

In his contribution, Faris states that a similar result is presented by Redhead in |23] at pp. 
101-102. Redhead claims, it would have been discovered at first by Suppes and Zanotti 24 . The 
derivation below will follow closely the lines of Faris: 

As shown in section 2.4, it results from the equivalence property of Eq. I|16[) and the passive 
locality condition of Eq. ()13[) that 



P<t>ip®<t>2p y°ipf\°2p\ TsJ = P<i> iri ®j> 2 p(<?ip\J r s) (39) 
and, similarly, 

p 0i,-i®02 t t (°"iArfV 2 ' 7 | • Fs ) = P^ift®^ i^pl^s) ■ (40) 
Using the assumption of passive locality again, we get with Eq. p9[) and Eq. (I40|) : 

= P<t> ip (S4>2p (^lp\J r s)P^ liX ti)4>2p (vzftlFs) 

= {P^p^A^^s)) 2 ■ (41) 
Eq. (|41[) implies, that the random variable P<t, 1 p®<t> 2 p ( a ip\ 3~s) can only have the values 1 and 0. 
The union event of the outcomes in the enlarged probability space for which P^j^s^a (<Jip\ J~s) 
has the value 1 will be denoted by Ais ■ From this definition and Eq. (|41l) . it follows that 
Prt>ip^(p 2 p (°imI Ps) i s equal to the indicator function of Ais: 

P 4>ip(S<l>2p (vip\Fs) = 1a 1s - (42) 

According to section 2.1, P</> 1( j®4> 2tt (vip\ J~s) is measurable with respect to Ts- Therefore, we 
must have Ais £ Fs- 

Now, we recall the definition 

P^it^p {<npt\ ?s) = P({fop ® fop £ cnp}\ Fs) (43) 

from section 2.1. In Eq. (|20l) . the unconditional probability of {fop ® fop £ crip} is given by the 
expectation value 

P<P ip ®<f>2p(°ip) = EX[P^»^(£T^| Tjy)] (44) 
P ( {fop ® fop £ cr ip }\T s ) dP. 

Similarly, the probability of an event Ais on the enlarged probability space with its measure P 
can be computed by the expectation value of the indicator function 1a 1s '■ 



EX[U 1S ] =J U ls dP = P(A ls ). (45) 

Using the Eqs. (gSJ, (O, HU) and (1431) . we can conclude that 

P^r^p M = P (Ais) ■ (46) 
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In section 2.1, we also have learnt that in case of Ais 7^ 0, the conditional probability of the 
intersection between the two events {4>ip ® 02m £ Pip} an d Ais £ Fs is equal to 



P ( {01m ® 02# £ &xp}\ Fs) for outcomes in Ais, 
for outcomes not in Ais- 
On the other hand, we get with Ais = 0: 



(47) 



P ( {0im ® 02m £ ctim) Pi A is| -^sj = P ( 0| -Fs) = for all outcomes. (48) 
The Eqs. 07), (g3) and (g2) lead to the relation 

P ({0im ® 02m £ ^imIH a M = P^®^ (<M Fs) = Uis- (49) 



Due to Eq. (|48|l and the definition of Ais, Eq. (J49j also holds in case of Ais = 0- Computing the 
expectation value from both sides of Eq. (|49[) yields 

p ({fan 9 02M £ ^iM> n Ais ) = p ( Ais ) • ( 50 ) 

Eqs. (146 1) and (|50|l imply that the event {01m <8> 02m £ °"im} £ Fi is equivalent to an event 
Ais £ Fs- It follows from Eq. J39), Eq. ((40) and the second line of Eq. ((41) that 

p *i#®«20 -^s) = (P<* lf r®0 2ft (^2mI -^s)) 2 ■ (51) 
Using Eq. ((42) together with Eq. (|51|) . we arrive at 

P 0i^®<#>2m (^imI-^s) = P<?»i^®02m (°"2mI Fs) = 1a 1s . (52) 

If an event A £ is equivalent to an event B £ T and B is equivalent to another event C € F, 
then A is also equivalent to C. The events {fap, ® 02m £ °"imj e -^i; {0iA* ® 02m ^ °"2m} £ ^2 
and Ais £ are not only in their sub-sigma algebras but also in JF. For this reason, the event 
{01m ® 02m £ °"2mj £ F% is equivalent to the same event Ais £ Fs which {4>ift ® fap £ vip} £ Fi 
is equivalent to. 

So we have the result that passive locality together with the exact anti correlations of entangled 
spin states imply that deterministic passive locality holds. The latter states that the events at the 
detectors must be predetermined by another event at preparation stage. 



4 The relation between Bell's theorem and the Free 
Will theorem of Conway and Kochen 

In this section, we review a Proof of Bell's inequalities that is given by Faris in [13] . It is interesting, 
because it relates Bell's theorem to what Kochen and Conway call the Free Will theorem [27] . 

In the previous section, we have shown that the event {4>\a ® 02A ^ °\a = 1") f° r a P arr 

of axes A and A is equivalent to some event Ais £ Fs that is generated by the random variable 

Xaa : = V<t> lA o>4, 2A (e^lFs). 

The active locality conditions of ((7{ and @ then imply that 

p *aW 2 A ( ct ia =t) = p 4>i^ 2 a Ka =t) 

= P ({fa X ® fa A £ (a lA =t)} f| {Xaa e ^is}) 

= P(foie4is}) 

= p ({0ia ® 02B e Ka =t)} f) feifl G Ais}) 

= P({XAb £A 1S }) 

= P XAS (Ais). (53) 
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The events {Xab ^ Ais} happens at preparation stage. Therefore, we have to restrict the 
possible axis dependence of that event even more. If we take the active locality condition seriously, 
then, the event {Xab ^ Ais} should indeed be equivalent to an event {Xa'b' ^ Ais}, for arbitrary 
axes A' A and B' B. 

Here, the conflict with quantum mechanics arises. Since angular momentum operators do not 
commute, there can, at least in quantum mechanics, not be a spin outcome for axis A that is 
equivalent to another spin outcome for anarbitrary axis A' 7^ A at the same detector. 

In Faris proof from section 3, passive locality together with the exact anticorrelations result 
in determinism. Now determinism together with active locality says that even for a set of non 
compatible observables, there must be an event that determines the values of these observables 
for all times. We will now proceed with Faris Proof of Bell's theorem and show, that this is not 
possible. 

A similar argument as given above implies that the event {4>ig 8> <f ) 2B a 2B ~4} ^ T2 for an 
axis B is equivalent to an event B2S € Ts and we get with active locality: 

P<t> lS m 2 g Kb =4-) = P (i<t>i3 ® <t>iB e Kb =i)}f]{XBB G B is>) 

= p (Wia ® 4>2B e Kb =4)} D ixxa e Bis}) 

= P x^(B2s) 

= P XA , a , (Bus) • (54) 
Since ® 2 b e Kg =4-)} is the complement of {4>ig ® 2 b e Kb =t)}i we a l so have 



P*„W 2 e (Va =t f>2B =t) = P({XABG^is}f|{^BG(B25) C }) 

= P^ s {A ls f](B2 S f) 

= (^isf|(^s) C ), (55) 

where (B2s)° denotes the complement of £?2S- 

We can do this for three axes at the detectors A, B, C. 

P * ljf ®* 2 3 ( a lA =t fl^B =t) + P ^ lf5 ®0 2 c ( ct ib =t fl^c =t) 

+P ^1C^2A (°"lC =t D CT 2A =t) 

= Pxa>b> (Axs f| (B 2S f) + PXS'C' (Bis f| (^s) C ) 

+PXc^ (c ls f|(A 2S ) c ) (56) 

The last equation contains the axes A', B' , C . These axes are arbitrary and by active locality 
the computed events at preparation stage are equivalent for all pairs of axes. Therefore, we can 
set A' = B' = C. 

P^ia^^b ( ct ia =t D a 2B =t) + P <t> lS ®<i> 2<3 ( ct ib =t Pi a 2c5 =t) 

+ P ^1C®*>2A (°"lC =t PKa =t) 

= PXX'X' D ( B ^f) + F Xa>a> (Bis fl (Cas)°) 

+PXA>A> (Cisp\(A2 S f) (57) 

For the event (A2s) C , there is an equivalent event {(j> 1A ~ ® 4>2A ^ ( a 2A = t)} a t the detectors. Now 
this event is, by Eq. (|16l) . equivalent to {4>ia® ( t > 2A ^ Ka = 4-)}' which, by active and passive 
locality, must be equivalent to the event (Ais) C £ J~s- Making the substitutions with the events 
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{Ms) C , (B 2 s) C , and (C 2 s) C , we arrive at: 

p *ia®*2b { a iA =t =t) + Fv lf5 «^2c (°"is =t n^a =T. 



p *^ on^) c ) + p *^ (^n^) c 



< 1 (58) 



In the last step, we have used that the events whose probabilities are computed are exclusive. 

Now, one can imagine a theory, where the events at the detectors do not depend on the 
measurement devices at all. In such a theory, we would have 



= P *iX®* 2j J (^lA =t PKs =t) + P * ljr ®* 2J t (°"lS =t PKc =t) 



(59) 

Since no outcome is assumed to depend on an axis, we can substitute the events at 2 by their 
equivalent events at 1: 



=t n^s =t) + p<* li5 ®0 2( 3 (o"is =t n^c =t 



< 1 (60) 

And we arrive again at Bell's inequality. 

Hence, there can be no theory for the singlett state at all, where the outcomes do not depend 
on the settings of the detectors. 

There may be theories for the singlett state, where an event at preparation stage determines 
the outcomes at the detectors. This preparation event then would have to depend on the settings 
of the measurement devices but these divices can later be chosen by the experimenters at will. 

Recently, Conway and Kochen discovered this result independently and called it Free- Will 
theorem. If active locality holds and the experimenters can choose their axis freely, the outcomes 
can not be pre-determined by a preparation event. As a result, the particles must behave randomly 
[27] . (From Nelson's writings on Bell's theorem, it seems that Nelson knew this since 1984. 
Apparently, Nelson's writings were unknown even to Kochen, who is in the same mathematics 
department at Princeton.) 

The author of this note believes that experimenters can choose their axis freely and active 
locality holds. Therefore, and because non-local theories can never be written manifestly covariant, 
the author thinks that non-local models of quantum mechanics should not be pursued. 

On his article, Faris summarizes his proof of Bell's theorem as follows: 

The logic of the proof is elementary but subtle, [...] The first part of the argument is the 
observation that when the two magnetic field gradients are taken in opposite direction, 
the results exactly coincide. The deterministic passive locality assumption says that 
this implies that the randomness must have been introduced at the time the particles 
were prepared, since a later source of randomness would spoil the coincidence. Thus, 
with the magnetic field gradient in opposite direction, each spin result is determined by 
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something that happened at the preparation stage. The spins may also be measured in 
the situation when the magnetic field gradient is not in the opposite direction to the 
magnetic field gradient for the other particle. According to active locality, the results 
of spin measurements should not depend on the direction used for the measurement of 
the other particle. Therefore, for the magnetic field gradients in arbitraty directions, 
the spin results for each particle are still determined by something that happened at 
the preparation stage. So, the magnets are not responsible for the randomness, it must 
be intrinsic to the system of two particles. In case of magnetic field gradients that are 
not the same or opposite, the dependence is strong but not perfect. This dependence 
is entirely due to something that happened at the preparation stage. The magnet 
configurations do not affect the probabilities. In this situation, [ Eq. 1581 ] says that the 
dependence is strong enough so that there can't be an intrinsic overall outcome that 
determines the result of each measurement. 



5 Conclusions 

The analysis of Nelson and Faris make clear that Bell's theorem does not require a theory that 
describes the quantum mechanical correlations with random variables to be non-local. In the 
words of a recent article of Nelson 25 : 

In quantum mechanics, if there are two dynamically uncoupled systems, an alteration 
of the second system in no way affects the first, even if the two systems are entangled. 

However, in order to maintain locality, a theory must give up determinism, since latter is a conse- 
quence of passive locality. 
In the words of Faris: 

A violation of active locality would be upsetting, since it would mean that an active 
intervention at one point could influence the outcome at distant points. However, a 
violation of passive locality would only mean that dependence between simultaneous 
events at distant locations need have no explanation in terms of prior events. This is 
not clearly ruled out, but it is not evident how to construct such a theory. 

Nelson's analysis of Bell's theorem also shows that, unfortunately, quantum mechanics contains 
no principle which actually forbids non-local theories. This is not what one should expect from 
a theory that can be written covariantly, and it is the reason for the interpretational problems of 
quantum mechanics. To construct an (actively) local theory of random variables that produces the 
quantum mechanical outcomes is therefore a reasonable scientific program. Only if we have such 
a model, we are able to exclude the various non-local and therefore non-physical interpretations 
of quantum theory. 

Nelson's stochastic mechanics could, for example, be developed into a local field theory that vi- 
olates passive locality for bosons [9] \W\ , including the electromagnetic field [26] . It was possible to 
quantise the linearized gravitational field from it [22]. Perhaps, a local theory of random variables 
that underlies quantum mechanics could be applied to field theoretical problems in strong gravi- 
tational fields, where ordinary quantum mechanics is not applicable. This is certainly whortwile 
to investigate. 

6 Relation between the assumptions behind Bell's in- 
equality and the locality principle of Einstein 

Finally, it is interesting to ask whether EPR would have called theories that violate passive locality 
to be complete. In their article pQ, they write: 

If, without in any way disturbing a sytem, we can predict with certainty the value of a 
physical quantity, then, there exists an element of reality corresponding to this physical 
quantity. 
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As we have shown in section 3, this reality criterion can not be applied to any theory that 
violates the passive locality condition of Eq. (|13[l . Furthermore, as there can be no device inde- 
pendent model for the singlett state at all, even in a deterministic theory, the experimenter at 2 
can not predict the outcome at 1, as he does not know the setting at 1, on which the outcome at 
1 must depend. In their article, EPR do not write whether they call a theory, where one is unable 
to predict the outcomes with certainty, as realistic or not. 

Einstein seemed to notice that the reality condition in [I] would be problematic. In a letter 
to Schroedinger, [28], Einstein complained that the main point which he wanted to make would 
have been "buried by erudition". Einstein then wrote own articles on his views about quantum 
mechanics. In his own writings, the reality criterion becomes a quite different one. He writes 1948 
in a letter to Born: 

I just want to explain what I mean when I say that we should try to hold on to physical 
reality. We are, to be sure, all of us aware of the situation regarding what will turn 
out to be the basic foundational concepts in physics: the point-mass or the particle is 
surely not among them; the field, in the Faraday-Maxwell sense, might be, but not with 
certainty. But that which we conceive as existing (real) should somehow be localized in 
time and space. That is, the real in one part of space, A, should (in theory) somehow 
exist independently of that which is thought of as real in another part of space, B. 
If a physical system stretches over the parts of space A and B, then what is present 
in B should somehow have an existence independent of what is present in A. What is 
actually present in B should thus not depend upon the type of measurement carried 
out in the part of space, A; it should also be independent of whether or not, after all, 
a measurement is made in A. If one adheres to this program, then one can hardly view 
the quantum-theoretical description as a complete representation of the physically real. 
If one attempts, nevertheless, so to view it, then one must assume that the physically 
real in B undergoes a sudden change because of a measurement in A. 

Clearly, the notion of "independence" here is not in the sense of statistical independence, or the 
"conditional independence" that is described by the passive locality condition of Eq. (Q3J. What 
Einstein meant is more in the sense of the active locality condition 

P ® 4>20 £ cri^} P| {010 ® (p 2 p> € o-i^} ) = P^-®^ (a lj2 ) = P<t> lit ®<i> 2S!l (o"im) , (61) 

which does not have to be violated in a random variable model of the singlett state. Einstein 
wrote the letter quoted above in 1948. It took time until 1986, when Edward Nelson discovered 
that exactly this locality principle which Einstein wanted to implement into quantum mechanics, 
was not forbidden by any theorem. On the contrary, since quantum mechanics just requires 

P 0ijr8x/>2P = ~P<t>ifl<8<t> 2 a> ( 62 ) 

, it does not say anything on whether active locality holds or not. So the construction of a model 
that contains this locality principle is still an open problem. 
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